Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Continuous Speech Recognition Using Dynamic Bayesian Networks: A Fast Decoding Algorithm

Identifieur interne : 006C23 ( Main/Exploration ); précédent : 006C22; suivant : 006C24

Continuous Speech Recognition Using Dynamic Bayesian Networks: A Fast Decoding Algorithm

Auteurs : Murat Deviren [France] ; Khalid Daoudi [France]

Source :

RBID : ISTEX:CAECBF3D332D0BEC13E478C9A063976DF7DCCF7C

Abstract

Abstract: State-of-the-art automatic speech recognition systems are based on probabilistic modeling of the speech signal using Hidden Markov Models (HMMs). Recent work has focused on the use of dynamic Bayesian networks (DBNs) framework to construct new acoustic models to overcome the limitations of HMM based systems. In this line of research we proposed a methodology to learn the conditional independence assertions of acoustic models based on structural learning of DBNs. In previous work, we evaluated this approach for simple isolated and connected digit recognition tasks. In this paper we evaluate our approach for a more complex task: continuous phoneme recognition. For this purpose, we propose a new decoding algorithm based on dynamic programming. The proposed algorithm decreases the computational complexity of decoding and hence enables the application of the approach to complex speech recognition tasks.

Url:
DOI: 10.1007/978-3-540-39879-0_16


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Continuous Speech Recognition Using Dynamic Bayesian Networks: A Fast Decoding Algorithm</title>
<author>
<name sortKey="Deviren, Murat" sort="Deviren, Murat" uniqKey="Deviren M" first="Murat" last="Deviren">Murat Deviren</name>
</author>
<author>
<name sortKey="Daoudi, Khalid" sort="Daoudi, Khalid" uniqKey="Daoudi K" first="Khalid" last="Daoudi">Khalid Daoudi</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:CAECBF3D332D0BEC13E478C9A063976DF7DCCF7C</idno>
<date when="2004" year="2004">2004</date>
<idno type="doi">10.1007/978-3-540-39879-0_16</idno>
<idno type="url">https://api.istex.fr/ark:/67375/HCB-X8H70WTR-P/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">003013</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">003013</idno>
<idno type="wicri:Area/Istex/Curation">002F74</idno>
<idno type="wicri:Area/Istex/Checkpoint">001832</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">001832</idno>
<idno type="wicri:doubleKey">1434-9922:2004:Deviren M:continuous:speech:recognition</idno>
<idno type="wicri:Area/Main/Merge">006F27</idno>
<idno type="wicri:Area/Main/Curation">006C23</idno>
<idno type="wicri:Area/Main/Exploration">006C23</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Continuous Speech Recognition Using Dynamic Bayesian Networks: A Fast Decoding Algorithm</title>
<author>
<name sortKey="Deviren, Murat" sort="Deviren, Murat" uniqKey="Deviren M" first="Murat" last="Deviren">Murat Deviren</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>INRIA-LORIA, Speech Group, B.P. 101, 54602, Villers lès Nancy</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Villers lès Nancy</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Daoudi, Khalid" sort="Daoudi, Khalid" uniqKey="Daoudi K" first="Khalid" last="Daoudi">Khalid Daoudi</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>INRIA-LORIA, Speech Group, B.P. 101, 54602, Villers lès Nancy</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Villers lès Nancy</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s" type="main" xml:lang="en">Studies in Fuzziness and Soft Computing</title>
<idno type="ISSN">1434-9922</idno>
<idno type="eISSN">1860-0808</idno>
<idno type="ISSN">1434-9922</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">1434-9922</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: State-of-the-art automatic speech recognition systems are based on probabilistic modeling of the speech signal using Hidden Markov Models (HMMs). Recent work has focused on the use of dynamic Bayesian networks (DBNs) framework to construct new acoustic models to overcome the limitations of HMM based systems. In this line of research we proposed a methodology to learn the conditional independence assertions of acoustic models based on structural learning of DBNs. In previous work, we evaluated this approach for simple isolated and connected digit recognition tasks. In this paper we evaluate our approach for a more complex task: continuous phoneme recognition. For this purpose, we propose a new decoding algorithm based on dynamic programming. The proposed algorithm decreases the computational complexity of decoding and hence enables the application of the approach to complex speech recognition tasks.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
</country>
<region>
<li>Grand Est</li>
<li>Lorraine (région)</li>
</region>
<settlement>
<li>Villers lès Nancy</li>
</settlement>
</list>
<tree>
<country name="France">
<region name="Grand Est">
<name sortKey="Deviren, Murat" sort="Deviren, Murat" uniqKey="Deviren M" first="Murat" last="Deviren">Murat Deviren</name>
</region>
<name sortKey="Daoudi, Khalid" sort="Daoudi, Khalid" uniqKey="Daoudi K" first="Khalid" last="Daoudi">Khalid Daoudi</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 006C23 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 006C23 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:CAECBF3D332D0BEC13E478C9A063976DF7DCCF7C
   |texte=   Continuous Speech Recognition Using Dynamic Bayesian Networks: A Fast Decoding Algorithm
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022